Employing Inductive Databases in Concrete Applications

نویسندگان

  • Rosa Meo
  • Pier Luca Lanzi
  • Maristella Matera
  • Danilo Careggio
  • Roberto Esposito
چکیده

In this paper we present the application of the inductive database approach to two practical analytical case studies: Web usage mining in Web logs and financial data. As far as concerns the Web domain, we have considered the enriched XML Web logs, that we call conceptual logs, produced by specific Web applications. These ones have been built by using a conceptual model, namely WebML, and its accompanying CASE tool, WebRatio. The Web conceptual logs integrate the usual information about user requests with meta-data concerning the Web site structure. As far as concerns the analysis of financial data, we have considered the trade stock exchange index Dow Jones and studied its component stocks from 1997 to 2002 using the so-called technical analysis. Technical analysis consists in the identification of the relevant (graphical) patterns that occur in the plot of evolution of a stock quote as time proceeds, often adopting different time granularities. On the plots the correlations between distinctive variables of the stocks quote are pointed out, such as the quote trend, the percentage variation and the volume of the stocks exchanged. In particular we adopted candle-sticks, a figurative pattern representing in a condensed diagram the evolution of the stock quotes in a daily stock exchange. In technical analysis, candle-sticks have been frequently used by practitioners to predict the trend of the stocks quotes in the market. We then apply a data mining language, namely MINE RULE, to these data in order to identify different types of patterns. As far as Web data is concerned, recurrent navigation paths, page contents most frequently visited, and anomalies such as intrusion attempts or a harmful usage of the resources are among the most important patterns. As far as concerns the financial domain, we searched for the sets of stocks which frequently exhibited a positive daily exchange in the same days, so as to constitute a collection of quotes for the constitution of the customers’ portfolio, or the candle-sticks frequently associated to certain stocks, or finally the most similar stocks, in the sense that they mostly presented in the same dates the same typology of candle-stick, that is the same behaviour in time. J.-F. Boulicaut et al. (Eds.): Constraint-Based Mining, LNAI 3848, pp. 295–327, 2005. c © Springer-Verlag Berlin Heidelberg 2005

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From Extensional to Intensional Knowledge: Inductive Logic Programming Techniques and Their Application to Deductive Databases

This chapter aims at demonstrating that inductive logic programming (ILP), a recently established subfield of machine learning that induces first-order clausal theories from examples, combines very well with the area of deductive databases. In the context of deductive databases, induction can be defined as inference of intensional knowledge from extensional knowledge. Classification-oriented IL...

متن کامل

An Ilp - Based Concept Discovery System for Multi - Relational Data Mining

AN ILP-BASED CONCEPT DISCOVERY SYSTEM FOR MULTI-RELATIONAL DATA MINING Kavurucu, Yusuf Ph.D., Department of Computer Engineering Supervisor : Asst. Prof. Dr. Pınar Şenkul July 2009, 118 pages Multi Relational Data Mining has become popular due to the limitations of propositional problem definition in structured domains and the tendency of storing data in relational databases. However, as patter...

متن کامل

Ontology-based Data Quality enhancement for Drug Databases

This paper proposes a solution to enhance data quality features in drug databases. The method we are using is based on the integration of controlled terminologies for the drug domain. This approach requires several steps : (1) transforming the terminologies into a Semantic Web compliant Description Logics, namely OWL DL, (2) associating new axioms to concepts of these ontologies based on induct...

متن کامل

Concrete application of learning theories in public health

Introduction: Educational psychology researchers have examined learning from different perspectives and their findings for explaining learning phenomena have led to different theories.Each of these theories can provide principles, rules, and specific requirements for classrooms.This study aimed to provide some examples of applications of learning theories in public health. Methods: Some databa...

متن کامل

The Molecular Feature Miner Molfea

Inductive databases are a new generation of databases, that are capable of dealing with data but also with patterns or regularities within the data. A user can generate, manipulate and search for patterns of interest using an inductive query language. Data mining then becomes an interactive querying process. The inductive database framework is especially interesting for bioand chemoinformatics,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004